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Abstract 

Recently, whole genome sequencing approaches have pinpointed mutations in genes that were previously not 
associated with cancer. For Acute Myeloid Leukaemia (AML), and other myeloid disorders, these approaches 
revealed a high prevalence of mutations in genes encoding the chromosome cohesion complex, cohesin. Cohesin 
mutations represent a novel genetic pathway for AML, but how AML arises from these mutations is unknown. This 
review will explore the potential mechanisms by which cohesin mutations contribute to AML and other myeloid 
malignancies. 
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Introduction 

The development and pathology of acute myeloid leukemia 
(AML) can be caused by a number of genetic alterations, 
although the molecular basis of AML is not yet thoroughly 
understood. Chromosomal translocations and variations 
such as t(15;17), t(8:21), inv(16), t(9;21), t(9;ll) are charac- 
teristic of AML, and suggest that genetic events play a key 
role in leukemogenesis [1]. However, nearly 50% of AML 
cases have a normal karyotype and lack major chromosome 
abnormalities. In an effort to elucidate the genetic basis of 
these cases, next-generation genome sequencing methods 
have been successfully used in recent years to identify many 
novel leukemogenic genes [2]. From these analyses, recur- 
rent mutations in genes encoding subunits of the cohesin 
complex emerged in AML genomes. Several studies have 
now revealed that mutations in the cohesin complex are 
strongly associated with AML, and furthermore, that cohe- 
sin mutations are also found at high frequency in other 
related myeloid malignancies. Cohesin mutations could 
therefore represent a potential new molecular mechanism 
underpinning oncogenesis. Cohesin has multiple functions, 
including roles in cell division, nuclear architecture, DNA 
damage repair, development and transcription, and these 
functions have been the subject of several comprehensive 
recent reviews [3-9]. In this concise, focused review, we will 



discuss molecular functions of cohesin that have potential 
to influence the etiology and progression of AML and other 
myeloid malignancies. 

Cohesin biology and cancer development 

Cohesin is a large ring-shaped protein complex consisting 
of four major subunits: SMC1A, SMC3, RAD21 and 
STAG1/2 [7] (Figure 1). While best known for its role in 
mediating sister chromatid cohesion from S phase until M 
phase [7], cohesin also plays crucial roles in DNA damage 
repair and gene expression [10], human development [11] 
and cancer [12,13]. Cohesin mutations are found in several 
cancer types [14-16], however their contribution to onco- 
genesis is unclear with both overexpression and mutation 
of cohesin subunits being implicated in cancer. For ex- 
ample, overexpression of cohesin subunit RAD21 in breast 
cancer is associated with poor prognosis and resistance to 
chemotherapy [17]. Cohesin mutations must necessarily 
lead to reduced, but not absent function, since complete 
loss of cohesin function blocks mitosis and results in cell 
death [18,19]. Therefore, the cohesin mutations found in 
cancer are usually heterozygous or hypomorphic. The 
mechanisms by which cohesin mutations contribute to can- 
cer probably involve multiple molecular pathways reflecting 
its non-mitotic molecular roles [12,13]. 
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Cohesin mutations in myeloid malignancies 

The association of cohesin mutations with myeloid ma- 
lignancy is particularly striking. Data from the Cancer 
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STAG1/STAG2 



Total cohesin: 9.1% 

Figure 1 Frequency of cohesin mutations in AML. Cohesin is a 
multi-subunit protein complex that is involved in chromosome 
pairing, DNA repair and transcription regulation. Mutations within 
the individual protein components of cohesin occur at significant 
frequency in AML. Data from references 20-28 were combined to 
determine the mutation frequency (boxes) in each of the cohesin 
subunits (SMC1A, SMC3, STAG 1/2, and RAD21). Details of mutations 
found in each study are presented in Table SI. 



Genome Atlas Research Network (TCGA) revealed that a 
significant proportion of AMLs had mutations in subunits 
of cohesin [20]. Somatic variants in cohesin genes were 
identified in 26/200 cases of AML subjected to exome or 
whole genome sequencing [20,21]. Sequencing of a separate 
set of AML samples by Welch et al. identified cohesin mu- 
tations in 7/108 cases [21]. Cohesin mutations occurred 
primarily in French-American-British (FAB) Ml and M2 
cases in the TCGA cohort, and Welch et al. found cohesin 
mutations exclusively in Ml cases [21]. The predominance 
of cohesin lesions in the most immature forms of AML 
suggests they were initiating events rather than passen- 
ger mutations [21]. Cohesin mutations co-occurred with 
NPM1, DNMT3A, TET2, or RUNX1 mutations in 17/19 
cases [21], implying cooperation with other leukemogenic 
pathways. Mutations in cohesin genes represented one of 
just nine categories of mutations thought to actively con- 
tribute to leukemogenesis [20] . Our calculation of the rate 
of cohesin mutations in AML using the TCGA data 
[20,21] and other published studies to date [22-28] indi- 
cates that the total rate of cohesin mutation in AML is 
around 9% (Figure 1, Table 1). Further details of the con- 
tributing studies can be found in the accompanying sup- 
plementary table (see Additional file 1: Table SI). 



The emergence of cohesin mutations in AML prompted 
Thol et al. [24] to sequence cohesin complex genes in 389 
AML samples, yielding a total of 23 mutations (5.9%). Mu- 
tations in cohesin subunits were mutually exclusive, and 
most mutations were found in karyotypically normal sam- 
ples. A strong correlation was observed between mutations 
in cohesin and the known AML-associated gene nucleo- 
phosmin (NPM1), with NPM1 -mutated patients twice as 
likely to also harbor a cohesin mutation compared with 
AZPMi-normal. Cohesin mutation status was not prognos- 
tically informative, nor did it correlate with any differences 
in clinical features. Allelic burden analysis suggested that 
cohesin mutations occurred as an early event during 
leukemogenesis [24]. 

While most evidence for cohesin mutations in mye- 
loid leukemia currently comes from AML, cohesin mu- 
tation is also implicated in related myeloid disorders. 
For example, Kon et al. [25] reported frequent muta- 
tions in cohesin components in a variety of myeloid 
neoplasms, including AML, myelodysplastic syndromes 
(MDS), chronic myelomonocytic leukemia (CMML), 
chronic myelogenous leukemia (CML) and classical my- 
eloproliferative neoplasms (MPN). Deep sequencing re- 
vealed that the majority of cohesin mutations existed in 
the major tumor populations, indicating they arose early 
in neoplasia. Strikingly, despite cohesin's known role in 
sister chromatid cohesion, myeloid malignancies with 
cohesin mutations were no more likely to be aneuploid 
than leukemias harboring other mutations [25]. Kon 
et al. conclude that, owing to their early origin and fre- 
quency in myeloid neoplasms, cohesin mutations ac- 
tively contribute to leukemogenesis [25] . 

Further evidence of cohesin's involvement in myeloid 
malignancies emerged from a recent study by Haferlach 
et al. showing that approximately 15% of patients with 
MDS harbor cohesin mutations [29]. The high proportion 
of cohesin mutations in MDS, combined with the fact that 
STAG2 and SMC1A mutations were significantly associ- 
ated with poor survival outcome, strongly suggests that 
cohesin mutation is central to the development and prog- 
nosis of MDS [29]. 

Yoshida et al. identified a striking association of cohesin 
mutation with another myeloid dysplasia, DS-AMKL [30]. 
Down's Syndrome (DS) patients can present with transient 
abnormal myelopoiesis (TAM) that is self-limiting in most 
cases. TAM is a myeloid proliferation resembling AML, 
and 10% of TAM progresses to non self-limiting acute 
megakaryoblastic leukemia (AMKL) in DS patients (DS- 
AMKL). Deep sequencing revealed that 53% of DS-AMKL 
samples had acquired cohesin mutations that were not 
found in somatic cells or the original TAM [30] . The high 
frequency of lesions in cohesin raises the strong possibility 
that cohesin mutation is instrumental to progression to 
DS-AMKL. 



Leeke ef al. Experimental Hematology & Oncology 2014, 3:13 
http://www.ehoonline.Org/content/3/1/13 



Page 3 of 8 



Table 1 Key findings from selected studies identifying cohesin mutations in myeloid malignancies 


Myeloid 
disorder(s) 


Cohesin mutation rate 


Rationale and key findings 


Ref # 


AML 


26/200 
\ I 37o; 


200 de novo AML samples were submitted to whole-genome/exome sequencing. The genes identified 
as recurrently mutated were grouped into nine functional categories important for AML: the cohesin 
complex was one such category. Cohesin complex mutations were mutually exclusive. 


[20] 


AML 


7/108 
(6.5%) 


Whole-genome sequencing (WGS) of 24 normal-karyotype M1 and M3 AML samples. Cohesin 
genes were only mutated in M1 samp es. Cohesin mutations were mutually exclusive and 
were not associated with chromosomal instability. 


[21] 


AML 


23/389 
(5.9%) 


Targeted sequencing of cohesin genes in 389 AML samples. Cohesin mutations significantly 
co-occurred with NPM1 mutations. Allelic burden analysis suggested cohesin mutations 
occurred early in AML. 


[24] 


AML 


23/197 
(11.7%) 


Targeted sequencing of 51 myeloid neoplasm candidate genes in 197 AML samples. 
Cohesin mutations were not associated with overall survival. 


[26] 


AML 


7/170 
(4.1%) 


Targeted sequencing of AML candidate loci in 50 AML samples. RAD21 mutations 
were present in all AML subtypes and were significantly associated with RAS mutations. 


[27] 


AML 
MDS 


12/158 
(7.6%) 


WGS of eight MDS and subsequent secondary-AML patient genomes. Targeted sequencing 
of 94 MDS/AML candidate loci. Each clone contained at least one mutation that recurs in 
MDS/AML. STAQ2 mutations significantly co-occurred with RUNX1 mutations. 


[28] 


MDS 


Approximately 15% 


Targeted sequencing of 104 MDS/AML candidate genes in 944 MDS samples. 47 genes 
were recurrently mutated in MDS. 14 of these genes (including STAG2) could successfully 
predict survival-outcome risk groups. 5TAG2 and SMC1A mutations were significantly 
associated with adverse patient outcome. 


[29] 


AML 
MDS 
CMML 


65/610 

\ I U. / /Oj 


Targeted sequencing of cohesin complex genes was undertaken in a cohort of 610 samples 
from various myeloid neoplasms. The core components of cohesin were significantly mutated. 
Cohesin mutations were present in the major tumor population in 15/20 available samples, 
indicating that cohesin mutations often occur as early events in oncogenesis. 


[25] 


CML 








MPN 








TAM 
AMKL 


39/86 
(45.3%) 


WGS of the genomes of TAM, AMKL, and DS-AMKL patients. Progression to DS-AMKL 
required acquisition of further mutations, including RAD21, 5TAG2, NRA5, CTCF, EZH2, and TP53. 


[30] 


DS-AMKL 




Cohesin mutations were present at a much higher rate in DS-AMKL than AMKL. 
Allelic burden analysis suggested that cohesin mutations occurred early in DS-AMKL. 





Despite the prevalence of cohesin mutations in myeloid 
dysplasia, the exact mechanism by which cohesin lesions 
contribute to cancer development is unclear. Accumulating 
evidence argues that cohesin mutation is an early event in 
myeloid oncogenesis. Welch et al. and TCGA showed that 
cohesin mutations mainly occur in the most immature 
AML subtypes [20,21]; clonal analysis by Kon et al. [25] 
and allelic burden analysis by Thol et al. [24] suggest that 
cohesin mutations occur as early events in leukemogenesis. 
What is the mechanism by which these mutations lead to 
cancer? In solid tumors with cohesin mutations, chromo- 
some instability and aneuploidy have been suggested as the 
mechanisms by which cohesin mutation facilitates neopla- 
sia [16,31,32], although other evidence argues against 
this idea [33]. For myeloid malignancies, a clear theme is 
emerging: heterozygous cohesin mutations do not cause 
chromosome instability [21,22,24,25,30] (for details, see 
Additional file 1: Table SI). This suggests that, at least in 
myeloid cancers, it is cohesin's non-mitotic roles that con- 
tribute to oncogenesis. 



Cohesin regulates gene transcription 

Cohesin's role in gene expression has been intensively in- 
vestigated over the last 15 years [10]. Several examples of 
cohesin-dependent gene regulation have been found, in- 
cluding regulatory roles at developmental genes [5] and in 
stem cells [34] . One of the potential mechanisms by which 
cohesin regulates gene transcription is through mediating 
long-range communication events that form DNA loops, 
which regulate transcription [6]. Enhancers (which pro- 
mote transcription) and insulators (which usually block 
transcription) are located in conserved regulatory elements 
(CREs) on chromosomes, and need not be close to the 
gene(s) they regulate. Cohesin is thought to physically 
connect distant CREs with gene promoters, in a cell type- 
specific manner, to modulate transcriptional outcomes [6] 
(Figure 2). Therefore, mutations in cohesin could impede 
cohesin binding to CREs, thereby altering their interaction 
with promoters, and subsequently gene activity. Similarly, 
mutations in the CREs that affect cohesin binding could 
alter transcription of the gene target(s) of that CRE. 
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cohesin 





CRE/Promoters 



Figure 2 Cohesin regulates gene expression by controlling CRE-promoter interactions. CREs can regulate gene expression by physically 
contacting a promoter, but are often located at a distance (tens of kilobases and sometimes megabases) from the promoter. Cohesin is involved 
in the establishment and maintenance of CRE-promoter interactions and can thereby control gene expression. Loss of cohesin can lead to loss of 
CRE-promoter interactions, resulting in inappropriate gene repression, or gene activation. 



In addition to connecting CREs with promoters, cohesin 
has an important role in organizing global genomic archi- 
tecture. Cohesin binding of DNA together with CCCTC- 
binding factor (CTCF) helps to partition the genome into 
megabase-sized regions known as fopologically associated 
domains (TADs) [35-37]. TADs are demarcated by bound- 
aries that are characterized by the presence of cohesin and 
CTCF, housekeeping genes, tRNAs and short interspersed 
element (SINE) retrotransposons [35]. Within TADs are 
regions of local chromosome interactions, which allow 
CREs to come into physical proximity with gene pro- 
moters to modulate gene expression [38-40]. While TAD 
boundaries are conserved between cell types, the chromo- 
some interactions within TADs vary, and provide a means 
for enabling cell type-specific transcription [35,38,40]. 

Although cohesin and CTCF frequently colocate on chro- 
mosomes [41-43], they appear to have distinct roles in gen- 
ome architecture [40,44,45]. Cohesin influences gene 
expression by coordinating interactions between CREs and 
promoters within TADs [38,40], while CTCF is important 
for preventing interactions between TADs [40]. Cohesin de- 
ficiency reduces the number of chromosome interactions 
within TADs and leads to altered expression of many genes 
[38-40]: different genes to those dysregulated upon CTCF 
depletion [40]. Because of the cell type specificity of CRE- 
promoter interactions within TADs, cohesin deficiency 
could result in an abnormal transcriptional profile for a 
particular tissue type (Figure 3). Moreover, there are sev- 
eral genomic sites where cohesin binds exclusively of 
CTCF, in combination with tissue-specific transcription 
factors [44,45]. For example, in mouse primary liver cells 
and hepatocellular carcinoma cells (HepG2), CTCF- 
independent cohesin binding sites are associated with ex- 
pression of liver-specific genes [44]. 

Strikingly, only a modest reduction in chromatin-bound 
cohesin is sufficient to cause changes in gene expression 
[46]. In human cells and mice, heterozygous mutations in 
the cohesin-loading factor NIPBL or in cohesin subunit 
SMC1A affect the expression of numerous genes [47,48]. 



In Drosophila, halving the gene dose of cohesin compo- 
nents robustly affects gene expression [49,50]. Therefore, 
leukemias with heterozygous cohesin mutations are also 
likely to be affected by the dysregulation of many genes. 

Altered cohesin function has potential to perturb 
hematopoietic gene expression 

A number of hematopoietic transcription factors are regu- 
lated by cohesin binding to CREs or promoters. For ex- 
ample, the hematopoietic transcription factor TALI is 
regulated at the transcriptional level by a chromatin hub 
containing cohesin and CTCF [51]. In further examples, 
the GATA2 gene contains an intronic +9.5 kb enhancer 
that is important for its expression [52], while the +85 ERG 
stem cell enhancer contains binding sites for a heptad of 
hematopoietic transcription factors and is thought to 
propagate a hematopoietic stem cell-like transcription pro- 
file [53]. Our survey of publically available ENCODE data 
revealed that cohesin binds both the GATA2 and ERG en- 
hancers in hematopoietic cells (K562). Therefore, it is pos- 
sible that cohesin mutation could alter the activity of these 
enhancers and their target genes in a leukemogenic setting. 

RUNX1 transcription is altered by cohesin deficiency 

The developmental transcription factor RUNX1 plays a 
particularly important role in myeloid malignancies. 
RUNX1 function is central to early myeloid differentiation 
and is absolutely required for definitive hematopoiesis 
[54,55]. RUNX1 is involved in chromosomal transloca- 
tions, such as t(12;21) in acute lymphoblastic leukemia in 
childhood and t(8;21) in acute myeloid leukemia, and is 
also targeted by point mutations and deletions [56]. 
Leukemic alterations of RUNX1 lead to abnormal protein 
function and thus dysregulation of RUNX1 target genes. 
The importance of RUNX1 function in hematopoiesis and 
leukemia has generated great interest in determining the 
factors that regulate its expression. 

It is interesting that DS-AMKL leukemias contain three 
copies of the RUNX1 gene (owing to trisomy 21), as well 



Leeke ef al. Experimental Hematology & Oncology 2014, 3:13 
http://www.ehoonline.Org/content/3/1/13 



Page 5 of 8 



Cohesin-mediated chromatin interactions 
occur within TADs 




DNA 

Cohesin 
CRE/Promoters 



AML 




Disruption of local 
CRE-promoter interactions 



Global loss of interactions 
within TADs 



Dysregulated expression I 
of hematopoietic genes e.g. RUNX1 

Figure 3 Model for cohesin's role in AML and other myeloid malignancies. Cohesin has an important function in the nucleus: it mediates 
chromosome interactions within topologically associated domains (TADs). Within TADs, cohesin connects conserved regulatory elements (CREs) with 
promoters, thereby regulating gene transcription. When cohesin function is compromised by a heterozygous mutation, as in AML, this leads to loss of 
CRE-promoter communication at specific hematopoietic genes, such as RUNX1. The result is dysregulation of hematopoietic transcription programs, 
which could facilitate the development of AML. In addition, loss of tissue-specific sub-domain structures affects the global hematopoietic 
transcription program. 



as having a remarkably high frequency of cohesin muta- 
tion (53%) [30]. Evocatively, data from zebrafish provided 
the first evidence that cohesin regulates tissue-specific 
Runxl transcription. In developing zebrafish embryos, a 
null mutation in the rad21 subunit of cohesin blocked 
runxl expression in hematopoietic mesoderm, but not in 
Rohon-Beard neurons [57]. That cohesin ablation affected 
hematopoietic progenitors, but not neurons, indicates that 
the transcriptional role of cohesin is tissue-specific in 
hematopoietic precursors. 

In mouse, a CRE enhancer resides in an intron between 
the PI (distal) and P2 (proximal) promoters of Runxl. 
This enhancer, termed +23 [58] or alternatively, +24 [59], 
is active only in precursors of hematopoietic stem cells 
where Runxl is endogenously expressed [58,59]. Cohesin 
subunit Rad21 binds the Runxl + 23/24 mouse enhancer 
region, which is also conserved in human [60]. ENCODE 
data from the leukemia K562 cell line indicates that the 
equivalent human CRE/enhancer also recruits cohesin 
subunits, together with CTCF [60] . 

In zebrafish, Marsman et al. showed that cohesin deple- 
tion altered the activity of intronic runxl CREs [60]. Mul- 
tiple binding sites were identified for cohesin and CTCF in 
the zebrafish runxl gene, coinciding with active CREs in 
the intron between PI and P2. Cohesin and CTCF deter- 
mine the spatial distribution of runxl transcripts in the 
zebrafish embryo at the onset of runxl expression, likely 



by controlling intronic CRE activity and CRE-promoter in- 
teractions. CTCF appears to restrict the expression pattern 
of runxl, consistent with insulator activity, while cohesin 
is necessary for its expression in a specific subpopulation 
of hematopoietic progenitors [57,60] . 

Interestingly, Marsman et al. also showed that siRNA 
knock down of cohesin (but not CTCF) in HL-60 myelo- 
cytic leukemia cells enhanced RUNX1 transcription [60], 
indicating that cohesin's transcriptional role is conserved 
in human cells. It is tempting to speculate that cohesin 
mutation leading to an increase in RUNX1 transcription 
might exacerbate myeloid malignancies that already have 
excess RUNX1; for example, DS-AMKL [30]. 

In summary, it appears that cohesin has a crucial role in 
cell type-specific regulation of Runxl, likely by mediating 
interactions between CREs and the promoters of Runxl. 
In support of this idea, ChlA-PET data generated in K562 
cells using RNA polymerase II demonstrated that the two 
promoters of human RUNX1 are in physical proximity 
with each other, and with CREs in the intron between the 
two promoters [61]. It is not yet known whether these in- 
teractions regulate RUNX1, or whether they are cohesin- 
dependent. While formal proof of this kind of mechanism 
for cohesin regulation of Runxl is still to come, the link 
between cohesin mutation and spatiotemporal Runxl 
transcription may explain cohesin's contribution to AML 
pathogenesis and other myeloid malignancies. 
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Conclusions 

Mutations in cohesin comprise a novel genetic pathway 
significantly associated with the development of AML and 
related leukemias. While several types of cancer do have 
cohesin mutations, most cancers also harbor many add- 
itional mutations in multiple gene categories [15]. By con- 
trast, AML genomes contain relatively fewer mutations 
than other cancer types, with only 23 genes significantly 
mutated [20]. Four of these genes correspond to cohesin 
subunits [15,20], indicating that cohesin mutations are 
particularly important to the progression of AML. 

Why do myeloid disorders have a high prevalence of 
cohesin mutations in particular? The answer could reside 
in cohesin's potential to mediate global transcriptional ac- 
tivity in a way that is also exquisitely cell type-specific. 

Evidence that cohesin regulates cell type-specific global 
gene transcriptional programs, and in particular, expres- 
sion of the AML-associated transcription factor, RUNX1, 
could explain why cohesin mutations are so prevalent in 
myeloproliferative disorders. Perhaps correct differenti- 
ation along the myeloid pathway relies on accurate expres- 
sion of key genes (such as RUNX1) that can only respond 
to a full complement of cohesin. When cohesin function is 
impaired, differentiation of myeloid precursors might be 
prevented, facilitating dysplasia. These notions support 
previous hypotheses that cohesin is likely to play an im- 
portant role in hematopoiesis [57,62] . 

Remarkably, cohesin binds to a majority of cell type- 
specific transcription factor binding sites, even when 
transcription factors themselves are evicted during mi- 
tosis [63]. In this manner, cohesin binding may 'book- 
mark' transcription factor binding sites to re-establish 
transcriptional programs after cell division [63], includ- 
ing sites for hematopoietic transcription factors. 

Further research will be necessary to understand 
exactly how cohesin functions in normal and abnormal 
hematopoiesis, and how cohesin mutations cooperate 
with other genetic events to progress leukemia. 

Additional file 



Additional file 1 : Table SI . Comprehensive summary of cohesin 
mutations in myeloid malignancies. 
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